Shor's algorithm

Shor's algorithm, named after mathematician Peter Shor, is a quantum algorithm (an algorithm which runs on a quantum computer) for integer factorization formulated in 1994. Informally it solves the following problem: Given an integer N, find its prime factors.

On a quantum computer, to factor an integer N, Shor's algorithm runs in polynomial time (the time taken is polynomial in log N, which is the size of the input).[1] Specifically it takes time O((log N)3), demonstrating that the integer factorization problem can be efficiently solved on a quantum computer and is thus in the complexity class BQP. This is exponentially faster than the most efficient known classical factoring algorithm, the general number field sieve, which works in sub-exponential time -- about O(e1.9 (log N)1/3 (log log N)2/3)(http://mathworld.wolfram.com/NumberFieldSieve.html). The efficiency lies in the efficiency of the quantum Fourier transform, and modular exponentiation by squarings.

Given a quantum computer with a sufficient number of qubits, Shor's algorithm can be used to break public-key cryptography schemes such as the widely used RSA scheme. RSA is based on the assumption that factoring large numbers is computationally infeasible. So far as is known, this assumption is valid for classical (non-quantum) computers; no classical algorithm is known that can factor in polynomial time. However, Shor's algorithm shows that factoring is efficient on a quantum computer, so an appropriately large quantum computer can break RSA. It was also a powerful motivator for the design and construction of quantum computers and for the study of new quantum computer algorithms. It has also facilitated research on new cryptosystems that are secure from quantum computers, collectively called post-quantum cryptography.

In 2001, Shor's algorithm was demonstrated by a group at IBM, who factored 15 into 3 × 5, using an NMR implementation of a quantum computer with 7 qubits.[2] However, some doubts have been raised as to whether IBM's experiment was a true demonstration of quantum computation, since no entanglement was observed.[3] Since IBM's implementation, several other groups have implemented Shor's algorithm using photonic qubits, emphasizing that entanglement was observed.[4][5]

Contents

Procedure

The problem we are trying to solve is: given an odd composite number N, find an integer d, strictly between 1 and N, that divides N. We are interested in odd values of N because any even value of N trivially has the number 2 as a prime factor. We can use a primality testing algorithm to make sure that N is indeed composite.

Moreover, for the algorithm to work, we need N not to be the power of a prime. This can be tested by taking square, cubic, ..., k-roots of N, for k \le \log_{2}(N), and checking that none of these is an integer. (This actually excludes that N = M^{k} for some integer M and k > 1.)

Since N is not a power of a prime, it is the product of two coprime numbers greater than 1. As a consequence of the Chinese remainder theorem, 1 has at least four distinct roots modulo N, two of them being 1 and -1. The aim of the algorithm is to find a square root b of 1, other than 1 and -1; such a b will lead to a factorization of N, as in other factoring algorithms like the quadratic sieve.

In turn, finding such a b is reduced to finding an element a of even period with a certain additional property (as explained below, it is required that the condition of Step 6 of the classical part does not hold). The quantum algorithm is used for finding the period of randomly chosen elements a, as order-finding is a hard problem on a classical computer.

Shor's algorithm consists of two parts:

  1. A reduction, which can be done on a classical computer, of the factoring problem to the problem of order-finding.
  2. A quantum algorithm to solve the order-finding problem.

Classical part

  1. Pick a random number a < N
  2. Compute gcd(a, N). This may be done using the Euclidean algorithm.
  3. If gcd(a, N) ≠ 1, then there is a nontrivial factor of N, so we are done.
  4. Otherwise, use the period-finding subroutine (below) to find r, the period of the following function:
    f(x) = a^x\ \mbox{mod}\ N,
    i.e. the order r of a in (\mathbb{Z}_N)^\times, which is the smallest positive integer r for which f(x%2Br) = f(x)
  5. If r is odd, go back to step 1.
  6. If a r /2 ≡ −1 (mod N), go back to step 1.
  7. gcd(ar/2 ± 1, N) is a nontrivial factor of N. We are done.

Quantum part: Period-finding subroutine

The quantum circuits used for this algorithm are custom designed for each choice of N and the random a used in f(x) = ax mod N. Given N, find Q = 2q such that N^2 \le Q < 2N^2, which implies Q/r > N. The input and output qubit registers need to hold superpositions of values from 0 to Q − 1, and so have q qubits each. Using what might appear to be twice as many qubits as necessary guarantees that there are at least N different x which produce the same f(x), even as the period r approaches N/2.

Proceed as follows:

  1. Initialize the registers to
    Q^{-1/2} \sum_{x=0}^{Q-1} \left|x\right\rangle \left|0\right\rangle
    where x runs from 0 to Q − 1. This initial state is a superposition of Q states.
  2. Construct f(x) as a quantum function and apply it to the above state, to obtain
    Q^{-1/2} \sum_x \left|x\right\rangle \left|f(x)\right\rangle.
    This is still a superposition of Q states.
  3. Apply the quantum Fourier transform to the input register. This transform (operating on a superposition of power-of-two Q = 2q states) uses a Qth root of unity such as \omega = e^{2 \pi i /Q} to distribute the amplitude of any given \left|x\right\rangle state equally among all Q of the \left|y\right\rangle states, and to do so in a different way for each different x:
    U_{QFT} \left|x\right\rangle
= Q^{-1/2} \sum_y \omega^{x y} \left|y\right\rangle.
    This leads to the final state
     Q^{-1} \sum_x \sum_y \omega^{x y} \left|y\right\rangle \left|f(x)\right\rangle.
    This is a superposition of many more than Q states, but many fewer than Q2 states. Although there are Q2 terms in the sum, the state \left|y\right\rangle \left|f(x_0)\right\rangle can be factored out whenever x0 and x produce the same value. Let
    \omega = e^{2 \pi i /Q} be a Qth root of unity,
    r be the period of f,
    x0 be the smallest of a set of x which yield the same given f(x) (we have x0 < r), and
    b run from 0 to \lfloor(Q-x_0-1)/r\rfloor so that x_0 %2B rb < Q.
    Then \omega^{ry} is a unit vector in the complex plane (\omega is a root of unity and r and y are integers), and the coefficient of Q^{-1}\left|y\right\rangle \left|f(x_0)\right\rangle in the final state is
     \sum_{x:\, f(x)=f(x_0)} \omega^{x y} = \sum_{b} \omega^{(x_0 %2B r b) y} = \omega^{x_0y} \sum_{b} \omega^{r b y}.
    Each term in this sum represents a different path to the same result, and quantum interference occurs—constructive when the unit vectors \omega^{ryb} point in nearly the same direction in the complex plane, which requires that \omega^{ry} point along the positive real axis.
  4. Perform a measurement. We obtain some outcome y in the input register and f(x_0) in the output register. Since f is periodic, the probability of measuring some pair y and f(x_0) is given by
     \left| Q^{-1} \sum_{x:\, f(x)=f(x_0)} \omega^{x y} \right|^2
= Q^{-2} \left| \sum_{b} \omega^{(x_0 %2B r b) y} \right|^2 = Q^{-2} \left| \sum_{b} \omega^{ b r y} \right|^2.
    Analysis now shows that this probability is higher, the closer unit vector \omega^{ry} is to the positive real axis, or the closer yr/Q is to an integer.Unless r is a power of 2, it won't be a factor of Q.
  5. Perform Continued Fraction Expansion on y/Q to make an approximation of it, and produce some c/r′ by it that satisfies two conditions:
    A: r′<N
    B: |y/Q - c/r′| < 1/2Q.
    By satisfaction of these conditions, r′ would be the appropriate period r with high probability.
  6. Check if f(x) = f(x + r′) \Leftrightarrow a^r \equiv 1 \pmod{N} If so, we are done.
  7. Otherwise, obtain more candidates for r by using values near y, or multiples of r′. If any candidate works, we are done.
  8. Otherwise, go back to step 1 of the subroutine.

Explanation of the algorithm

The algorithm is composed of two parts. The first part of the algorithm turns the factoring problem into the problem of finding the period of a function, and may be implemented classically. The second part finds the period using the quantum Fourier transform, and is responsible for the quantum speedup.

I. Obtaining factors from period

The integers less than N and coprime with N form a finite Abelian group G under multiplication modulo N. The size is given by Euler's totient function \phi(N). By the end of step 3, we have an integer a in this group. Since the group is finite, a must have a finite order r, the smallest positive integer such that

a^r \equiv 1\ \mbox{mod}\ N.\,

Therefore, N divides (also written | ) a r − 1 . Suppose we are able to obtain r, and it is even. (If r is odd, see step 5.) Now b \equiv a^{r/2} \pmod{N} is a square root of 1 modulo N, different from 1. This is because r is the order of a modulo N, so a^{r/2} \not\equiv 1 \pmod{N}. If a^{r/2} \equiv -1 \pmod{N}, by step 6 we have to restart the algorithm with a different random number a.

Eventually, we must hit an a, of order r in G, such that b \equiv a^{r/2} \not\equiv 1, -1 \pmod{N}. This is because such a b is a square root of 1 modulo N, other than 1 and -1, whose existence is guaranteed by the Chinese remainder theorem, since N is not a prime power.

We claim that d = \operatorname{gcd}(b-1, N) is a proper factor of N, that is, d \ne 1, N. In fact if d = N, then N divides b - 1, so that b \equiv 1 \pmod{N}, against the construction of b. If on the other hand d = \operatorname{gcd}(b-1, N) = 1, then by Bézout's identity there are integers u, v such that

(b-1) u %2B N v = 1.

Multiplying both sides by b%2B1 we obtain

(b^{2}-1) u %2B N(b%2B1) v = b%2B1.

Since N divides b^{2} - 1 \equiv a^{r} - 1 \pmod{N}, we obtain that N divides b%2B1, so that b \equiv -1 \pmod{N}, again contradicting the construction of b.

Thus d is the required proper factor of N.

II. Finding the period

Shor's period-finding algorithm relies heavily on the ability of a quantum computer to be in many states simultaneously. Physicists call this behavior a "superposition" of states. To compute the period of a function f, we evaluate the function at all points simultaneously.

Quantum physics does not allow us to access all this information directly, though. A measurement will yield only one of all possible values, destroying all others. But for the no cloning theorem, we could first measure f(x) without measuring x, and then make a few copies of the resulting state (which is a superposition of states all having the same f(x)). Measuring x on these states would provide different x values which give the same f(x), leading to the period. Because we cannot make exact copies of a quantum state, this method does not work. Therefore we have to carefully transform the superposition to another state that will return the correct answer with high probability. This is achieved by the quantum Fourier transform.

Shor thus had to solve three "implementation" problems. All of them had to be implemented "fast", which means that they can be implemented with a number of quantum gates that is polynomial in \log N.

  1. Create a superposition of states. This can be done by applying Hadamard gates to all qubits in the input register. Another approach would be to use the quantum Fourier transform (see below).
  2. Implement the function f as a quantum transform. To achieve this, Shor used repeated squaring for his modular exponentiation transformation. It is important to note that this step is more difficult to implement than the quantum Fourier transform, in that it requires ancillary qubits and substantially more gates to accomplish.
  3. Perform a quantum Fourier transform. By using controlled rotation gates and Hadamard gates Shor designed a circuit for the quantum Fourier transform (with Q = 2q) that uses just q(q-1)/2 = O((\log Q)^2) gates.[6]

After all these transformations a measurement will yield an approximation to the period r. For simplicity assume that there is a y such that yr/Q is an integer. Then the probability to measure y is 1. To see that we notice that then

e^{-2 \pi i b yr/Q} = 1\,

for all integers b. Therefore the sum whose square gives us the probability to measure y will be Q/r since b takes roughly Q/r values and thus the probability is 1/r^2. There are r y such that yr/Q is an integer and also r possibilities for f(x_0), so the probabilities sum to 1.

Note: another way to explain Shor's algorithm is by noting that it is just the quantum phase estimation algorithm in disguise.

Discrete logarithms

Suppose we know that x = g^r \pmod{p}, for some r, and we wish to compute r, which is the discrete logarithm: r = \log_g x \pmod{p}. Consider the Abelian group \left( \mathbb{Z}_{p} \right)^\times \times \left(\mathbb{Z}_p\right)^\times where each factor corresponds to modular multiplication of nonzero values, assuming p is prime. Now, consider the function

f(a,b) = g^a x^{-b} \pmod{p}.

This gives us an Abelian hidden subgroup problem, as f corresponds to a group homomorphism. The kernel corresponds to modular multiples of (r,1). So, if we can find the kernel, we can find r

In popular culture

On the television show Stargate Universe, the lead scientist, Dr. Nicholas Rush, hoped to use Shor's algorithm to crack Destiny's master code. He taught a quantum cryptography class at the University of California, Berkeley, in which Shor's algorithm was studied.[7]

Shor's algorithm was also a correct answer to a question in a Physics Bowl competition on the television show The Big Bang Theory.

References

  1. ^ See also Pseudo-polynomial time.
  2. ^ Vandersypen, Lieven M. K.; Steffen, Matthias; Breyta, Gregory; Yannoni, Costantino S.; Sherwood, Mark H. & Chuang, Isaac L. (2001), "Experimental realization of Shor's quantum factoring algorithm using nuclear magnetic resonance" (PDF), Nature 414 (6866): 883–887, arXiv:quant-ph/0112176, Bibcode 2001Natur.414..883V, doi:10.1038/414883a, PMID 11780055, http://cryptome.org/shor-nature.pdf 
  3. ^ Braunstein, S. L.; Caves, C. M.; Jozsa, R.; Linden, N.; Popescu, S.; Schack, R. (1999), "Separability of Very Noisy Mixed States and Implications for NMR Quantum Computing", Phys. Rev. Lett 83 (5): 1054–1057, arXiv:quant-ph/9811018, Bibcode 1999PhRvL..83.1054B, doi:10.1103/PhysRevLett.83.1054 
  4. ^ Lu, Chao-Yang; Browne, Daniel E.; Yang, Tao & Pan, Jian-Wei (2007), "Demonstration of a Compiled Version of Shor's Quantum Factoring Algorithm Using Photonic Qubits", Physical Review Letters 99 (25): 250504, Bibcode 2007PhRvL..99y0504L, doi:10.1103/PhysRevLett.99.250504 
  5. ^ Lanyon, B. P.; Weinhold, T. J.; Langford, N. K.; Barbieri, M.; James, D. F. V.; Gilchrist, A. & White, A. G. (2007), "Experimental Demonstration of a Compiled Version of Shor's Algorithm with Quantum Entanglement", Physical Review Letters 99 (25): 250505, Bibcode 2007PhRvL..99y0505L, doi:10.1103/PhysRevLett.99.250505 
  6. ^ Shor 1999, p. 14.
  7. ^ http://stargate.wikia.com/wiki/Shor%27s_algorithm

Further reading